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1  Abstract 

The  Haiti  earthquake  relief  effort  in  2010  involved  numerous  civil  and  military  organizations  engaged  in 
vigorous  relief  operations,  exemplifying  rapid  deployment  of  logistics,  transport,  security  and  medical 
supplies.  Yet,  to  determine  the  effectiveness  of  these  operations,  and  to  understand  how  these  various 
organizations  collaborated,  and  eventually  how  to  improve  the  future  overall  effectiveness  of  these 
operations,  requires  sifting  through  large  volumes  of  communications  and  determining  a  methodology  so  that 
analysts  may  understand  how  to  gain  understanding  and  analyze  such  data. 

We  have  been  leveraging  “Lexical  Link  Analysis”  (LLA),  as  one  means  to  achieve  “System  Self-awareness” 
(SSA),  and  develop  this  as  a  methodology  to  analyze  large  text-based  data  sets,  such  as  the  interagency 
collaborations  and  problem  solving  processes  using  documents  obtained  from  traditional  communication 
platforms  as  well  as  new  social  media.  Last  year,  we  reported  using  LLA  to  analyze  open  source  data 
sources  of  the  Haiti  relief  operations.  This  year,  we  will  report  on  the  analysis  applied  to  the  APAN  (The  All 
Partners  Access  Network)  forum  discussion  data  from  the  relief  effort. 


2  Introduction 

As  we  introduced  and  reported  previously  (Zhao,  etc.,  2011),  in  the  aftermath  of  the  Haiti  earthquake,  US 
military  and  civil  organizations  provided  rapid  and  extensive  relief  operations.  Organizations  involved  in  the 
operation  collaborated  with  each  other  to  negotiate  roles,  define  limits,  set  business  rules  etc.,  to  maximize 
efficiency  and  decrease  response  times.  To  determine  that  the  expectations  for  effective  and  efficient 
operations  had  been  attained  would  require  in-depth  analysis  of  collaboration,  dialog  and  resultant  outcomes. 
Such  analysis  by  manual  means  would  require  numerous  man-hours  and  arduous  effort. 

The  challenge  is  to  read  and  understand  the  data  that  are  collected  in  real-life  events  and  to  gain  insight  on 
what  the  roles  and  relationships  of  these  organizations  were  and  how  the  operations  were  actually 
conducted,  in  terms  of  how  various  organizations  (military  and  civil)  actually  interacted-  and  how  their 
synergies  were  achieved. 

In  a  technical  sense,  data  sources  for  analyzing  collaborations  among  military,  government  and  civil 
stakeholders  typically  include  unstructured  data  such  as  free  text,  word,  .pdf,  or  PowerPoint  documents  in 
collaboration  websites  such  as  APAN,  and  new  and  varied  platforms  of  communication  such  as  the  internet, 
intranet,  and  myriad  social  media.  The  data  has  been  overwhelming  yet  may  contain  the  critical  insights  and 
could  impact  and  help  the  management  of  future  operations.  Cross-examining  all  this  data  requires 
automated  tools  which  in  the  future  may  provide  real-time  analysis  when  applied  to  future  events. 

The  US  SOUTHERN  COMMAND  engaged  in  the  HAITI  Humanitarian  Assistance/Disaster  Relief  (HA/DR) 
Community  of  Interest  (COI)  on  the  All  Partners  Access  Network  (APAN)  during  the  Haiti  crisis.  The 
APAN  data  was  captured  from  an  MSSQL  database.  The  samples  from  the  data  include: 

•  Official  documents  and  briefings  in  PDL:  -167  PDL  file  attachments  related  to  HAITI  HA/DR  from 
1/13/2010  to  5/26/2010 


2 


Report  Documentation  Page 


Form  Approved 
OMB  No.  0704-0188 


Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 
VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 


1.  REPORT  DATE 

JUN  2012 


2.  REPORT  TYPE 


4.  TITLE  AND  SUBTITLE 

Semantic  and  Social  Networks  Comparison  for  the  Haiti  Earthquake 
Relief  Operations  from  APAN  Data  Sources  using  Lexical  Link  Analysis 
(LLA) 


6.  AUTHOR(S) 


7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS (ES) 

Naval  Postgraduate  School, Distributed  Information  Systems 
Experimentation  (DISE)  Group, Monterey, CA, 93943 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS  (ES) 


3.  DATES  COVERED 

00-00-2012  to  00-00-2012 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

8.  PERFORMING  ORGANIZATION 
REPORT  NUMBER 


10.  SPONSOR/MONITOR’S  ACRONYM(S) 

11.  SPONSOR/MONITOR’S  REPORT 
NUMBER(S) 


12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

Presented  at  the  17th  International  Command  &  Control  Research  &  Technology  Symposium  (ICCRTS) 
held  19-21  June,  2012  in  Fairfax,  VA. 

14.  ABSTRACT 

The  Haiti  earthquake  relief  effort  in  2010  involved  numerous  civil  and  military  organizations  engaged  in 
vigorous  relief  operations,  exemplifying  rapid  deployment  of  logistics,  transport,  security  and  medical 
supplies.  Yet,  to  determine  the  effectiveness  of  these  operations,  and  to  understand  how  these  various 
organizations  collaborated,  and  eventually  how  to  improve  the  future  overall  effectiveness  of  these 
operations,  requires  sifting  through  large  volumes  of  communications  and  determining  a  methodology  so 
that  analysts  may  understand  how  to  gain  understanding  and  analyze  such  data.  We  have  been  leveraging 
?Lexical  Link  Analysis?  (LLA),  as  one  means  to  achieve  ?System  Self-awareness?  (SSA),  and  develop  this 
as  a  methodology  to  analyze  large  text-based  data  sets,  such  as  the  interagency  collaborations  and  problem 
solving  processes  using  documents  obtained  from  traditional  communication  platforms  as  well  as  new 
social  media.  Last  year,  we  reported  using  LLA  to  analyze  open  source  data  sources  of  the  Haiti  relief 
operations.  This  year,  we  will  report  on  the  analysis  applied  to  the  APAN  (The  All  Partners  Access 
Network)  forum  discussion  data  from  the  relief  effort. 


15.  SUBJECT  TERMS 


16.  SECURITY  CLASSIFICATION  OF: 


a.  REPORT 

unclassified 


b.  ABSTRACT 

unclassified 


c.  THIS  PAGE 

unclassified 


17.  LIMITATION  OF 

18.  NUMBER 

ABSTRACT 

OF  PAGES 

Same  as 

42 

Report  (SAR) 

19a.  NAME  OF 
RESPONSIBLE  PERSON 


Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


17th  ICCRTS,  International  Command  and  Control,  Research  and  Technology  Symposium,  Fairfax,  Virginia,  June  19-21,  2012 


•  SITREP:  -150  Situation  Report  documents 

•  Forum:  -1400  posts  from  1/13/2010  to  6/3/2010 

•  Blogs:  -3900  blog  messages 


In  the  previous  ICCRTS  publication  (Zhao  et  ah,  2010),  we  leveraged  “Lexical  Link  Analysis”  (LLA)  and  a 
related  concept,  “System  Self-awareness”  (SSA)  to  analyze  the  open  sources  of  Haiti  operations  to  address 
the  business  questions.  In  this  current  paper,  we  seek  to  leverage  the  same  method  and  concept  but  focus  on 
the  APAN  forum  data. 

The  APAN  HA/DR  forum  data  originally  included  many  discussions  for  the  disasters,  threats,  and  risks  and 
was  not  limited  to  the  Haiti  earthquake,  we  applied  a  filter  to  the  posts  containing  a  small  word  list,  i.e. 

Haiti,  jacmel,  usns,  Port-au-Prince,  Pap,  etc.  to  exclude  the  ones  with  empty  message  bodies  or  posted  by 
anonymous  users,  which  resulted  in  the  data  set  of -1400  posts.  This  data  set  not  only  revealed  the 
collaborations  among  military,  government,  and  civil  stakeholders  in  the  crisis  via  social  networks,  it  also 
recorded  the  content  that  reflects  how  these  stakeholders  solved  problems  by  addressing  issues  raised  in  the 
thread  discussions.  A  discussion  thread,  for  instance,  started  when  a  person  raised  a  question,  and  other 
people  followed  up  with  answers  with  different  levels  of  detail. 


In  this  paper,  we  will  discuss  two  applications  of  using  LLA  and  the  APAN  forum  data  to: 

•  Discover  themes  and  topics  in  the  discussion  forum  and  sort  the  importance  of  the  themes  (Section 
3.2) 

•  Discover  social  and  semantic  networks  of  organizations  that  were  involved,  compare  the  two 
networks  to  obtain  insights  to  answer  the  following  questions: 

o  What  were  the  organizations  involved  in  the  important  themes  in  the  Haiti  operation 
(Section  3.3.2  Table  2,  3,  4  and  5)? 

o  How  do  semantic  networks  suggest  more  potential  collaboration  when  compared  to  social 
networks  (Section  3.3.2)? 

3  Approaches 

3.1  Lexical  Link  Analysis  (LLA) 

As  in  military  operations,  where  the  term  situational  awareness  is  coined,  we  note  that  that  our  efforts  can 
inform  awareness  of  analyzed  data,  in  a  unique  way  that  helps  improve  decision  maker’s  understanding  or 
awareness  of  its  content.  We  therefore  define  awareness  as  the  cognitive  interface  between  decision  makers 
and  a  complex  system,  expressed  in  a  range  of  terms  or  “features,”  or  specific  vocabulary  or  “lexicon,”  to 
describe  the  attributes  and  surrounding  environment  of  the  system.  Specifically,  LLA  is  a  form  of  text 
mining  in  which  word  meanings  represented  in  lexical  terms  (e.g.  word  pairs)  can  be  represented  as  if  they 
are  in  a  community  of  a  word  network.  Figure  1  demonstrates  an  example  of  LLA  with  connected  keywords 
or  concepts  extracted  from  the  documents  of  Maritime  Domain  Awareness  (MDA)  technologies.  Words  are 
linked  as  word  pairs.  Different  colors  indicate  different  clusters  of  word  groups.  In  Figure  2,  the  word 
“behavior”  is,  for  example,  centered  within  “suspicious,  bad,  dangerous,  abnormal,  usual,  and  anomalous” 
and  so  on,  showing  the  ways  to  describe  “behavior”  in  this  data  found  from  MDA. 
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Figure  2:  A  Single  Word  Hub  Showing  Linkage  (Detail  of  Figure  1  in  the  red  circle) 

The  detailed  steps  of  LLA  processing  include  applying  collaborative  learning  agents  (CLA)  and  generating 
visualizations,  including  a  lexical  network  visualization  via  AutoMap  (2009),  radar  visualization,  and  matrix 
visualization  (Zhao  et  al.  2010).  The  following  are  the  steps  for  performing  an  LLA: 

•  Read  each  set  of  documents. 

•  Select  feature-like  word  pairs. 
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•  Apply  a  social  network  community  finding  algorithm  (e.g.  Newman  grouping  method;  Girvan  et  al. 
2001)  to  group  the  word  pairs  into  themes.  A  theme  includes  a  collection  of  lexical  word  pairs 
connected  each  other. 

•  Compute  a  “weight”  for  a  theme  for  the  information  of  a  time  period,  that  is,  how  many  word  pairs 
belong  to  a  theme  for  that  time  period  and  for  all  the  time  periods. 

•  Sort  theme  weights  by  time,  and  study  the  distributions  of  the  themes  by  time. 

3.2  Discover  Themes 

Figure  3  shows  the  themes  discovered  from  the  APAN  forum  data  set  sorted  by  the  number  of  lexical  links 
(i.e.  word  pairs)  -  the  column  “LLA  Counts”  in  each  theme.  Figure  4  shows  the  lexical  links  in  the  theme 
associated  with  the  keywords  “WATER,  AIR,  FOOD.”  A  theme  is  summarized  generally  using  a  three-word 
list.  Each  list  of  words  becomes  a  word  hub  with  a  large  number  of  links  (i.e.  total  degree  as  a  centrality 
measure).  The  word  hubs  are  connected  at  most,  a  node  apart.  Sorted  themes  indicate  the  main  and  important 
topics  (themes)  discussed  in  the  APAN  forum.  We  further  divided  data  between  1/13/2010  to  6/30/2010  into 
one  week  as  a  period  after  the  event  until  4/3/2010.  The  data  after  4/3/2010  belongs  to  one  period.  The  dates 
are  the  first  days  of  these  weeks. 


Table  1  shows  the  distributions  of  LLA  counts  for  the  themes  associated  with  keywords  “WATER,  AIR, 
FOOD”  and  “PEOPLE,  CHILDREN,  POST”  according  to  these  weeks  divided,  indicating  these  activities 
were  continuously  performed  in  many  weeks  after  the  event. 


Event  Date  Sort 

Theme  Id 

LLA  Counts 

Theme  Keywords 

ALL 

95 

83 

WATER, AIR.FOOD 

ALL 

65 

67 

PEOPLE,  CHILDREN  .POST  ! 

ALL 

6 

65 

APANfCONTACTfUSER  j 

ALL 

people 

65 

HAITI  .OFFICER,  PRESIDENT 

ALL 

64 

60 

PAP, EMERGENCY, ASSESSMENT, ASSISTANCE 

ALL 

41 

58 

INFORMATIONXOMMUNITY,  REQUESTS 

ALL 

54 

56 

MEDICALJI  ME,  LIMITED 

ALL 

29 

56 

EARTHQUAKE,  LEFT,  MASSIVE,  QUAKE 

ALL 

46 

51 

P  ATI  E  NTS,  LO  N  G ,  CARE 

ALL 

70 

50 

RELIEF, FIELD,EFFORT 

ALL 

73 

50 

GROUP,  SEEpFRENCH 

ALL 

96 

50 

DE,POSTEDpINITIALp  WORKING 

ALL 

22 

50 

COUNTRYpHEALTH, AREAS,  CHILE 

ALL 

place 

49 

STATEpPORT  AU  PRINCE, CITY 

ALL 

78 

47 

BUILDING,  HQpCIVIL,  USERS 

ALL 

86 

44 

TEAMpCOORDINATI  ON,  LEAD 

ALL 

39 

42 

HOSPITALpBEDpHOUSE 

ALL 

82 

42 

SUPPORTpPERSONNELpCLIENTpUNIT 

Figure  3:  Sorted  Themes  for  All  the  Threads 
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Table  1:  Distributions  of  LLA  Counts  in  the  Weeks 
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Dates 

Theme  ID 

LLA  Counts 

Theme  Keywords 

ALL 

95 

S3 

WATER,  AIR,  FOOD 

1/20/2010 

95 

39 

WATER, AIR, FOOD 

1/13/2010 

95 

37 

WATER, AIR, FOOD 

1/27/2010 

95 

20 

WATER, AIR, FOOD 

3/10/2010 

95 

7 

WATER,AIR,FOOD 

2/10/2010 

95 

3 

WATER,AIR,FOOD 

2/17/2010 

95 

3 

WATER, AIR, FOOD 

2/24/2010 

95 

3 

WATER, AIR, FOOD 

2/3/2010 

95 

2 

WATER,  AIR.FOOD 

3/3/2010 

95 

2 

WATER, AIR, FOOD 

ALL 

65 

67 

PEOPLE, CHILDREN, POST 

1/20/2010 

65 

35 

PEOPLE, CHILDREN, POST 

1/13/2010 

65 

19 

PEOPLE, CHILDREN, POST 

1/27/2010 

65 

S 

PEOPLE, CHILDREN, POST 

2/10/2010 

65 

7 

PEOPLE, CHILDREN, POST 

3/3/2010 

65 

4 

PEOPLE, CHILDREN, POST 

2/24/2010 

65 

3 

PEOPLE, CHILDREN, POST 

2/3/2010 

65 

2 

PEOPLE, CHILDREN, POST 

2/17/2010 

65 

1 

PEOPLE, CHILDREN, POST 

3/10/2010 

65 

1 

PEOPLE, CHILDREN, POST 

3/21/2010 

65 

1 

PEOPLE, CHILDREN, POST 

The  themes  found  using  LLA  are  consistent  with  the  existing  doctrines  of  the  roles  of  military  and  non¬ 
government  organizations  (NGO)  might  engage  in  if  a  disaster  like  the  Haiti  earthquake  occurs.  For 
example,  United  States  Agency  for  International  Development  (USAID)  specifies  a  cluster  system  for  what 
various  organizations  should  perform,  where  military  is  employed  in  the  tasks  of  handling  logistics  and  relief 
supplies  (USAID,  2010). 


When  looking  at  the  detail  and  the  themes  produced  using  LLA,  the  actual  focuses  of  the  APAN  forum  were 
revealed.  Figure  5  shows  the  themes  sorted  using  the  numbers  of  lexical  links  in  the  first  and  second  weeks 
of  the  Haiti  operation  (1/13/2010  to  1/19/2010  and  1/20/2010  to  1/26/2010).  Compared  to  the  overall  themes 
in  Figure  3,  themes  on  the  first  two  weeks  were  different,  indicating  priorities  in  the  initial  phase  of  the  relief 
effort  were  different  than  in  the  total  relief  effort.  Figure  7  and  Figure  6  show  the  lexical  links  for  the  themes 
associated  with  the  keywords  “HIGH,  RATES”,  “AIRCRAFT,  GROUND,  ASSIST,  ORDERED, 
USNSCOMFORT”  and  “SUPPORT,  PERSONNEL,  CLIENT,  UNIT”  are  not  found  among  the  top  ten  themes 
in  Figure  3.  These  were  examples  of  the  themes  that  were  unique  to  the  first  week.  For  example,  the  theme 
“HIGH,  RATES”  was  related  to  the  fact  that  the  US  government  provided  the  analysis  and  information  about 
high  fatalities,  high  risks  and  high  disease  rates  that  might  be  expected  after  the  event.  This  suggests  these 
important  issues  were  heavily  discussed  and  collaborated  in  the  time  of  frame  of  two  weeks.  The  focuses  of 
the  APAN  forum  activities  have  been  shifted  from  the  logistics  centric  first  week  to  medical  and  patient 
centric  second  week  in  the  context  of  the  overall  efforts. 
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Event  Date  Sort 

Theme  Id 

LIA  Counts 

Theme  Keywords 

1/13/2010 

ALL 

630 

1/13/2010 

95 

37 

WATERfAIR,FOOD 

1/13/2010 

37 

25 

HIGHPRATES 

1/13/2010 

5 

22 

AIRCRAFT, GROUNDpASSlSTsORDEREDpUSNS  COMFORT 

1/13/2010 

6 

21 

A  PAN,  CONTACT,  USER 

1/13/2010 

S2 

21 

SUPPORTpPERSONNELpCLIENTpUNIT 

1/13/2010 

65 

19 

PEOPLEpCHILDRENpPOST 

1/13/2010 

people 

IS 

HAITI  .OFFICER,  PRESIDENT 

1/13/2010 

64 

IS 

PAPpEMERGENCY.  ASSESSMENT,  ASSISTANCE 

1/13/2010 

79 

IS 

STA  F  F,  V  E  RS 1  ON,  BATTLE,  LO  GS,  RECORDS 

1/13/2010 

69 

IS 

PROVIDE,  DEAD,NEEDED 

1/13/2010 

39 

17 

HOSPITALpBEDpHOUSE 

Even:  Date  Sort 

Theme  Id 

LLA  Counts 

Theme  Keywords 

1/20/2010 

95 

39 

WATER, AiRpFOQD 

1/20/2010 

65 

35 

PEOPLE,  CHILDREN,  POST 

1/20/2010 

people 

33 

HAITI,  OFFICERpPRESIDENT 

1/20/2010 

46 

32 

PATIENTSpLONIGXARE 

1/20/2010 

29 

26 

EARTHQUA<E,LE=T,MA55IVE,QUA<E 

1/20/2010 

54 

25 

MEDICALpTIMEpLIMITED 

1/20/2010 

41 

24 

INFORMATIONpCOMMUNITY,  REQUESTS 

1/20/2010 

6 

23 

A  PA  N,CO  NT A  CT,  USER 

1/20/2010 

70 

23 

RELIEF, FIELD, EFFORT 

1/20/2010 

72 

23 

RESPONSE,  HELlCOPTERpPART 

Figure  5:  Themes  sorted  for  the  first  week  starting  on  1/13/2010  (to  1/19/2010)  and  the  second  week  starting 

on  1/20/2010  (to  1/26/2010). 


Figure  6:  Lexical  links  for  the  theme  associated  with  keywords  “HIGH,  RATES”. 
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Figure  7:  Lexical  links  for  the  theme  associated  with  keywords  “AIRCRAFT,  GROUND,  ASSIST, 

ORDERED,  USNS  COMFORT”. 


Figure  8  shows  the  overlapping  of  top  ten  themes  sorted  by  the  LLA  counts  overall  (ALL)  and  the  first 
(Week  1)  and  second  weeks  (Week  2).  Our  observations  reveal  that: 

•  Six  out  of  ten  top  themes  (60%)  in  the  top  themes  of  all  are  also  in  the  top  themes  of  the  first  weeks. 
Likewise,  seven  of  ten  top  themes  in  the  first  two  weeks  are  also  in  the  top  overall  themes,  indicating 
the  LLA  counts  are  indeed  valid  measures  to  sort  out  the  main  and  important  topics  and  themes.  In 
this  example,  the  important  topics  were  handled  in  the  first  two  weeks  of  the  event. 

•  Some  of  the  important  topics  or  themes  which  are  picked  out  by  the  LLA  counts  are  not  shown  in  the 
first  two  weeks.  For  example,  “MEDICAL,  TIME,  LIMITED”  and  “PAP,  EMERGENCY, 
ASSESSMENT,  ASSISTANCE”  may  not  be  feasible  to  address  as  the  high  priority  items  for  the 
first  two  weeks.  While  the  themes  “INFORMATION,  COMMUNITY,  REQUESTS”  and  “RELIEF, 
FIELD,  EFFORT”  can  be  themes  that  are  important  across  all  the  time  frames. 


Theme  Keywords 

Theme  Keywords 

\\\ 

WATER,  A 1 R,  F  0  0  D 

Week  1 

WATER,AIR,FOOD 

PEOPLE,  CHILDREN  .POST 

HIGH,RATES 

A  PAN,  CONTACT,  USER 

AIRCRAFT,  GROUND,  ASSIST, ORDERED,USNS COMFORT 

HAITI,OFFICER,  PRESIDENT 

A  P  AN,CO  NTACT,  US  ER 

PA  P,  EM  ER  G  EN  CY,  ASS  ESS  M  ENT,  ASS  1ST A  NCE 

SUP  PO  RT,  P  ERS  0  N  N  EL,  CLI  ENT,  UNIT 

1 N  F  0  R  M  ATI  0  N,COM  M  U  N ITY,  R  EQU  ESTS 

Week  2 

WATER,AIRPFOOD 

M  ED  1 CA  L,TI  M  E,  LI  M ITED 

PEOPLE,  CHILD  REN,  POST 

EARTHQUAKE,  LEFT,  MASSIVE,  QUAKE 

HA  ITI,OFF  ICER,  P  R  ESI  D  ENT 

P  ATI  E  NTS,  LO  N  G,  CARE 

PATIEMTS,LGNG,CARE 

RELIEF,FIELD,EFFORT 

EARTH  QUAKE,  LEFT,  MASSIVE,  QUAKE 

Figure  8:  The  overlapping  of  top  ten  themes  sorted  by  the  LLA  counts  overall  (ALL)  and  the  first  and  second 
weeks. 
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Much  of  text  analysis  depends  on  initially  searching  the  available  internet.  At  this  point,  our  efforts  are 
sometimes  compared  to  those  of  a  typical  search  engine.  One  of  the  disadvantages  of  conventional  search 
engines  is  that  they  typically  sort  documents  based  on  the  popularity  of  documents  among  linked  documents, 
not  based  on  semantics.  Therefore,  it  does  not  satisfy  complete  search  needs  nor  determine  relevance  if  the 
links  among  the  documents  are  not  available.  For  example,  the  content  in  the  forum  is  not  cross-linked, 
therefore,  the  discovered  or  revealed  topics  or  themes  cannot  be  found  as  prioritized  results,  if  conventional 
search  engines  are  used. 


The  LLA  approach  is  more  properly  related  to  Latent  Semantic  Analysis  (LSA;  (Dumais,  Furnas,  Landauer, 
Deerwester,  &  Harshman,  1988;  Gorman,  2003;  Letsche,  1997)  and  Probabilistic  Latent  Semantic  Analysis 
(PLSA).  In  the  LSA  approach,  a  term-document  matrix  is  the  starting  point  for  analysis.  The  elements  of  the 
term-document  or  feature-object  (term  as  feature  and  document  as  object)  matrix  are  the  occurrences  of  each 
word  in  a  particular  document,  i.e.  A  =  [ a ij],  where  atj  denotes  the  frequency  in  which  term  j  occurs  in 
document  i.  The  term-document  matrix  is  usually  sparse.  LSA  uses  singular  value  decomposition  (SVD)  to 
reduce  the  dimensionality  of  the  term-document  matrix.  SVD  cannot  be  applied  to  the  cases  where  the 
vocabulary  (the  unique  number  of  terms)  in  the  document  collection  is  large.  LSA  has  been  widely  used  to 
improve  information  indexing,  search/retrieval  and  text  categorization. 


A  recent  development  related  to  this  method  is  called  Latent  Dirichlet  allocation  (LDA;  Blei,  Ng,  &  Jordan, 
2003),  which  is  a  generative  probabilistic  model  of  a  corpus.  In  LDA,  a  document  is  considered  to  be 
composed  of  a  collection  of  words — a  “bag  of  words,”  where  word  order  and  grammar  are  not  considered 
important.  The  basic  idea  is  that  documents  are  represented  as  random  mixtures  over  latent  topics,  where 
each  topic  is  characterized  by  a  statistical  distribution  (Dirichlet  distribution)  over  the  corpus.  Our  theme 
generation  from  LLA  is  different  than  LDA,  in  which  a  collection  of  lexical  terms  are  connected  to  each 
other  semantically,  as  if  they  are  in  a  social  community,  and  social  network  grouping  methods  are  used  to 
group  the  words.  Our  method  is  easily  scaled  to  analyze  a  large  vocabulary  and  is  generalizable  to  any 
sequential  data. 

3.3  Semantic  and  Social  Networks 

3. 3. 1  Extracting  Social  Networks 

Current  research  of  social  network  analysis  mostly  focuses  on  using  a  simple,  weighted,  or  arrowed  links 
between  two  objects,  such  as  people  or  organizations,  regardless  of  the  contents  (Wasserman,  1994).  The  so- 
called  study  of  centrality  (Girvan  2002)  has  been  a  focal  point  for  the  social  network  structure  studies  in 
recent  years.  Finding  the  centrality  of  a  network  lends  insight  into  the  various  roles  and  groupings,  such  as 
the  connectors  (for  example,  mavens,  leaders,  bridges,  isolated  nodes),  the  clusters  (and  who  is  in  them),  the 
network  core,  and  its  periphery. 

We  extract  the  social  networks  based  how  an  organization  was  engaged  in  a  discussion  of  an  issue,  i.e.,  a 
discussion  thread.  For  example,  in  Figure  9,  Organization  “DoD  Air  Force,”  ”SSSG,”  etc.  were  engaged  in 
the  discussion  in  Thread  5741,  regarding  the  subject  “mtpp  airfield  sitrep”,  therefore,  we  assume  these 
organizations  were  working  together  and  formed  social  network  connections. 

Figure  10  shows  an  organization  and  thread  matrix  which  lists  how  many  times  an  organization  was  involved 
in  the  thread  discussion.  This  matrix  was  then  used  to  generate  the  social  networks  of  participating 
organizations  -  one  organization  was  connected  to  another  if  they  were  both  involved  in  at  least  one  of  the 
same  thread  discussions. 


With  the  themes  separated  from  the  theme  discovery  process,  we  were  able  to  look  at  the  organization 
connections  within  each  theme.  While  traditional  social  networks  typically  look  at  a  social  network  as  a 
whole,  Figure  1 1  shows  a  social  network  of  the  organizations  involved  in  the  Haiti  operation  only  for  the 
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theme  “WATER,  AIR,  FOOD”.  The  light  blue  connections  represent  the  social  network  connections  that  are 
overlapped  with  the  semantic  networks,  indicating  that  these  organizations  not  only  appeared  together  in  the 
same  threads,  but  also  discussed  concepts  and  contents  similar  to  each  other  which  were  the  basis  to  form 
semantic  connections  (see  Section  3.3.2.).  Figure  12  shows  the  organizations  illustrated  in  a  node  cloud, 
where  the  ones  with  larger  fonts  are  the  organizations  with  higher  centralities  measured  in  total  degrees  in 
Figure  11. 


1 

Thread 

Level 

Time 

User 

Organization 

Content 

t-3 

5722 

2 
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Figure  9:  An  APAN  forum  discussion  thread  example 
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Figure  10:  Organization  and  thread  matrix 
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Figure  1 1 :  Social  networks  associated  with  the  theme  “WATER,  AIR,  DATA.”  The  light  blue  links  are  the 
social  network  links  that  are  also  overlapped  in  the  semantic  networks. 
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Figure  12:  Organizations  illustrated  in  a  node  cloud:  the  ones  with  larger  fonts  are  the  organizations  with 
higher  centralities  (total  degrees)  as  found  in  Figure  11. 
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3. 3. 2  Extracting  Semantic  Networks 

A  semantic  network  is  not  necessarily  a  real  connection  between  two  organizations  in  real-life, 
instead,  it  is  a  network  measuring  how  much  similarity  of  content  that  two  organizations  present  in 
the  forum,  a  measure  of  like-mindedness.  It  is  reasonable  to  assume  that  like-minded  organizations 
do  use  many  of  the  same  terminologies  in  their  communications.  For  that,  like-mindedness  is  closer 
in  meaning  to  like-wordedness,  which  is  the  reason  to  cluster  them  together.  In  this  case,  by  using 
LLA,  a  semantic  network  for  two  organizations  can  be  generated  by  counting  the  common  word 
hubs  in  each  theme  for  the  contents  of  two  organizations.  In  Figure  13,  Organization 
“PrivateUSACopemicusEnergylnc”  shares  the  word  hubs  with  various  other  organizations  such  as 
“MilitaryUSAArmy”  and  “MilitarySouthAfricaArmyAZ”.  Word  hubs  in  common  are  “Information- 
“Small—”  and  “Power—”,  etc.  The  words  such  as  “_Energy”  and  “_Engineering”  attached  next 
to  the  organizations  are  the  self-claimed  function  areas,  possibly  reflecting  their  motivations  to  join 
the  discussions.  The  weights  next  to  the  word  hubs,  such  as  “Information—”,  “Small—”  and  “Power- 
-”,  are  the  numbers  of  lexical  links  in  the  themes  to  which  the  word  hubs  belong.  They  indicate 
how  important  these  word  hubs,  or  semantic  connections  are  for  the  two  organizations,  as  they  are 
sorted  in  Figure  13.  In  this  example,  the  word  hub  “Power—”  has  the  highest  weight  302.0.  We  also 
submit  that  some  like-minded  organizations  might  use  different  terminologies  in  their  activities. 

For  example,  the  military  and  emergency  response  organizations  often  use  different  words,  yet  for 
the  same  meaning  and  organizations.  The  military  especially  seems  to  use  many  acronyms  instead 
of  words.  In  this  case,  LLA  often  places  the  different  terminologies  in  the  same  theme  although  they 
do  not  share  the  same  word  hubs  or  appear  in  disconnected  components  of  word  networks  such  as 
“CVN”  and  “Carl_  Vinson”  shown  in  Figure  7.  In  this  case,  the  similarity  of  two  organizations  can 
be  counted  by  the  words  in  the  same  themes.  This  paper  only  addresses  the  like-mindedness  in  the 
former  case  using  similarities  of  word  hubs. 
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Figure  13:  Using  LLA  to  form  semantic  network  connections 


Figure  14  shows  the  social  networks  (blue)  in  Figure  1 1  overlaid  with  the  semantic  networks  (green)  which 
are  associated  with  the  theme  with  the  keywords  “WATER,  AIR,  FOOD.”  The  light  blue  connections  are  the 
ones  overlapped  in  the  two  networks  analyzed. 


Note  that  in  Figure  14,  a  few  nodes  with  only  semantic  connections,  i.e.  only  green  links  can  be  found:  These 
are  the  organizations  which  had  no  social  connections  with  other  organizations  in  the  theme,  however,  they 
had  discussion  content  which  are  similar  to  the  ones  of  other  organizations,  therefore,  they  are  the  potential 
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high-value  targets  to  be  included  in  the  collaborations.  Figure  15  shows  the  word  hubs  which  are  the  basis  for 
one  of  the  semantic  network  nodes:  PrivateUSAStrategosSecurityContracting. 
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jtfryUSASOUTHCOM_ManagementSOUTHCOM 
£torCaribbeanMissions_ 

MationalDefenseUniversity_ 


Figure  14:  Semantic  networks  (green)  overlaid  on  the  top  of  the  social  networks  for  the  theme  associated 
with  keywords  “WATER,  AIR,  FOOD”. 
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Figure  15:  The  word  hubs  for  PrivateUSAStrategos  SecurityContracting  in  the  semantic  network. 

Table  2  shows  the  Organizations  involved  in  the  theme  “WATER,  AIR,  FOOD”.  The  same  analyses  can  be 
done  for  other  themes.  Table  3,  Table  4  and  Table  5  show  the  organizations  involved  in  the  three  other 
important  themes  “HIGH,  RATES”,  “AIRCRAFT,  GROUND,  ASSIST,  ORDERED,  USNSCOMFORT”,  and 
“MEDICAL,  TIME,  LIMITED”.  The  highlighted  ones  are  the  ones  suggested  by  semantic  networks,  which 
have  no  social  network  connections  with  other  organizations. 
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Table  2:  Organizations  involved  in  the  theme  “WATER,  AIR,  FOOD” 


GovernmentUSADoDJT 

M  i  1  it  a  ry  S  o  ut  h  A  f  ri  ca  A  rmy  ZA 

Military USAAirForce Evacuation 

Military  USAAirForce GIS 

Military  US  A  A  rmy 

Military  USAArmy Engineering 

MilitaryUSABoo:AllenHamilton DoDJoint 

Military  USADoDJjinnulla'ionJoint 

Militan/USANationalDefenseUniversity 

MilitaryUSANavy 

Military  USANavy Command 

Military  USANavyJnformationManagement 

Military  USAPACOM APAN 

Military  USAPACOMJrTFormationManagerrient 

Military  USAPACOM Simulation 

Military  USASOUTHCOM 

Military  USASOUTFiCOMJnformationManagement 

Military  USASOUTHCOM ManagementSOUTHCOM 

Military  USASOUTHCOM Partnering 

Military  USASOUTHCOM Planning 

Military  USASOUTHCOM Security 

NGOHaitiHaitiMissioriesofCharity 

NGOUSAFieldDirectorCaribbeanMissions 

NGOUSAG056TheMITRECorpora-ion  TIS 

KGOUSAHillsideChurch ChurchMission 

NGOUSAinSTEDD PublicHeal1hDisasierResponse 

NGOUSAProjectKIDInc Adopiion 

NGOUSATheScarletProjeci Humani1arianAssistance 

NGOUSAWorldCares DisasterResponseNetworks 

:t  vaieGereva  r 7 err  a:  :  "=  Cc:ec-a  ^c Trace  -“saec:  c- 

Priva:eHai1iHopita[SacreCoeur Medtcal 

PrivatGUKCTOSkyVisionGlobalNetworksUK TelecommunicationUK 

PrivaieUSACopernicusEnergylnc  Er ergy 

Priva"eUSAEmicuscom DisasterResponse 

PrivateUSAGloballnstituteForDisruptiveEventsGIDEJnfectiousDiseaseMedical 

PrivaieUSARapidResponseConsulting TelecomnnunicationsDisasterResponse 

-r  va:e „SAS:ra:eeos_5ecLr  :yCor:rac:  rg 
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Table  3:  Organizations  involved  in  the  theme  “HIGH,  RATES” 


Government  USADoD_ 

GovernmentUSADoDJT _ 

Military  USANationalDefenseUniversity_ 

Military  USAPACOMJnfomnationManagement _ 

Military  USAPACOM_Simulation 

Military  USASOUTHCOM_ _ 

PrivateUSACopernicusEnergylntjEnergy 

PnvaieUSADigitalGlobelnc GISInternationalDefenseSales 

PrivaieUSAERDASInc  GIS 


Table  4:  Organizations  involved  in  the  theme  “AIRCRAFT,  GROUND,  ASSIST,  ORDERED, 

USNS  COMFORT 


EducationUSAUniversityofHawaijDisasterResponse 

GovernmentUSADoDJT 

MilitarySouthAfricaArmyZA 

MilitaryUSAAirForce Evacuation 

MilitaryUSAAirForce GIS 

Military  USAAirForce M  an  age  merit  Airforce 

MilitaryUSABoo:AllenHamilton DoDioint 

MilitaryUSADoD GIS 

MilitaryUSAPACOM APAN 

Military  USAPACOMJnformationManagement 

MilitaryUSASOUTHCOM 

Military  USASOUTHCOMJnformationManagement 

MilitaryUSASOUTHCOM Planning 

NGOUSAHillsideChurch ChurchMission 

NGOUSAProjectKIDInc Adop:ion 

NGOUSAWorldCares DisasterResponseNetworks 

Priva:eUSAASIGroup_Securiiy  ! 

PrivateUSAGloballnstituteForDisruptiveEventsGIDE_lnfectiousDiseaseMedica;l  ? 

PrivateUSARapidResponseConsulting_TelecommunicationsDisasterResponse 
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Table  5:  Organizations  involved  in  the  theme  “MEDICAL,  TIME,  LIMITED” 


Govern  nnentUSADoD 

GovernmentUSADoDJT 

GovemmenfUSADoD Join1Airliff 

GovernmentUSADoD JointCivillnformationManagement 

MilitaryUSAAirForcejGIS 

Military  USAArmy MedicineSurgeonGeneral 

MilitaryUSADoDJnformationManagementJoint 

Military  USADoD SinnulatioriJoint 

MilitaryUSANationalDefenseUniversity 

V  :arylSA\avy^o  r: 

MillitaryUSAPACOM APAN 

Military  USAPACOMJnformationManagement 

MilitaryUSAPACOM Simulation 

Military  USASOUTHCOM 

MilitaryUSASOUTHCOMJnformationManagement 

Military  USASOUTMCOMJntelligence 

Military  USASOUTHCOM Security 

NGOHaitiHaitiMissioriesofCharity 

NGOUSAHillsideChurchjChurchMission 

NGOUSAPartnersinHealth Logi  sties 

NGOUSAProjectKIDInc Adoption 

NGOUSAWorldCares DisasterResponseNetworks 

PrivateHaitiHopitalSacreCoeur Medical 

-  r  va:elSAG  oba  l-ea  :h  r  :  a:  ves  rcjvec  c  re 

Private  USARapidResponseCon5ultingy_Tele€ommunicationsDisasterResponse 

The  core  innovation  here  is  to  use  the  LLA  method  to  analyze  the  forum  content,  therefore  generate 
alternative  networks,  i.e.  semantic  networks  that  can  be  used  to  compare  with  the  social  networks.  When 
comparing  semantic  networks  with  the  social  networks,  we  showed  how  to  measure  synergies  among 
organizations  and  how  the  synergies  can  be  varied  among  different  themes.  We  also  showed  the  potential  to 
rank  high-value  organizations  earlier  in  the  collaboration  process  based  on  their  common  interests  from  the 
semantic  network  analysis  for  the  discovered  themes. 

4  Additional  Observations 

As  our  goal  is  to  use  LLA  to  understand  the  synergy,  efficiency,  and  competency  of  an  interagency  operation 
that  involves  many  organizations,  we  use  lexical  links  as  metrics  to  measure  synergy-like  values.  One  of 
such  synergy  is  measured  using  the  ratio  of  the  links  in  the  overlapped  parts  of  social  networks  and  semantic 
networks. 
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Figure  16  shows  a  holistic  view  of  social  networks  and  semantic  networks  for  the  organizations  associated 
with  the  theme  “WATER,  AIR,  FOOD”,  where  the  red  connections  linking  the  organizations  that  have 
content  related  to  the  theme.  In  Figure  17,  the  table  shows  the  summary  of  connection  counts  for  the  top  ten 
themes  found  in  the  overall  forum  and  on  the  first  two  weeks  (highlighted  in  yellow).  The  table  is  sorted 
according  to  the  Synergy  column,  computed  as  the  ratio  of  from  the  columns  “Social  (Blue  and  Light  Blue)” 
and  “Overlapped  (Light  Blue)”.  It  also  lists  the  number  of  potential  semantic  links  (green)  and  number  of 
total  organizations  (red)  involved  in  the  theme.  Our  observations  are  summarized  as  follows: 

1 .  There  are  fewer  number  of  organizations  involved  on  the  two  weeks  than  in  the  overall  forum,  where 
the  synergies  seem  go  above  the  average  (>80%)  for  some  specific  and  time-critical  themes,  for 
example,  “HIGH, RATES”,  “PEOPLE, CHILDREN, POST”,  “AP AN, CONTACT, USER”, 

“AIRCRAFT,  GROUND , ASSIST,  ORDERED, USNSCOMFORT” ,” 

SUPPORT, PERSONNEL, CLIENT, UNIT”,  and  “EARTHQUAKE, LEFT, MASSIVE, QUAKE” 

2.  There  are  some  themes  with  relatively  low  synergies  overall  and  on  the  two  weeks,  for  example, 
“WATER,  AIR,  FOOD”,  “PATIENTS,  LONG,  CARE”, ’’HAITI,  OFFICER,  PRESIDENT”.  This 
may  indicate  lack  of  knowledge  or  expertise  in  these  areas  in  general. 

3.  Comparing  the  two  weeks  and  the  average,  the  synergies  among  organizations  seem  correlated  with 
the  number  of  organizations  involved  in  the  collaborations:  the  themes  with  fewer  organizations 
involved  have  higher  synergies. 
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Figure  16:  A  holistic  view  of  social  network  and  semantic  networks  for  the  organizations  associated  with  the 
theme  “WATER,  AIR,  FOOD”. 
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90  298 

33 
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Yes 

PAP,  EMERGENCY, ASSESSMENT,  ASSI  STANCE 

252 

174  312 

37 

69% 

NO 

Figure  17:  Synergy  measured  using  the  ratios  of  overlapping  connections  in  the  social  networks  and  semantic 
networks. 

5  Conclusion 

In  this  paper,  we  explored  a  real-life  data  set,  i.e.  APAN  forum  data  in  the  Haiti  interagency  operation  to 
illustrate  using  the  Lexical  Link  Analysis  1)  to  discover  the  themes  and  sort  them  according  to  the 
importance  and  time;  2)  to  extract  social  and  semantic  networks  of  participating  organizations. 

In  summary,  the  LLA  method  can  be  used  as  a  tool  to  examine  historic  data  and  improve  future  operation; 
identify  partners  for  collaboration  or  areas  of  overlap,  for  example,  as  a  monitoring  tool  to  visualize 
organizational  behavior  and  examining  the  aspects  of  the  situational  awareness  of  an  interagency  operation  in 
real-time.  Automation  can  be  achieved  via  computer  programs  or  software  agents  such  as  CLA  to  perform 
this  task  frequently  -  and  in  near  real-time,  thus  placing  “sifted”  data  into  the  hands  of  analysts  allowing 
them  the  opportunity  to  understand  revealed  themes  and  evolving  trends  contained  within  the  data  that 
heretofore  had  been  buried  and  likely  left  poorly  understood. 
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Distributed  Information  Systems  Experimentation  (DISE) 


Research  Questions 


DISE 


•  How  can  we  discover  themes  and  topics  in  the 
discussion  forum  and  sort  the  importance  of  the 
themes? 

•  How  can  we  discover  social  and  semantic  networks 
of  organizations  that  were  involved,  compare  the 
two  networks  to  obtain  new  insights? 

-  What  were  the  organizations  involved  in  the  important 
themes  in  the  Haiti  operation? 

-  How  do  semantic  networks  offer  more  potential 
collaboration  when  compared  to  social  networks? 
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HAITI  Project 


•  Data  Sources 

-  ~2600  open  source  web  pages 

•  Used  High  Performance  Computer  (HPC)  nodes. 

•  News  feed  from  1/13  -  2/23/2010 

•  Twitter  as  a  starting  point 

-  SOUTHCOM,  USAID  and  others  used  Twitter  to  handle  the  situations 

—  US  SOUTHERN  COMMAND  also  engaged  the  HAITI  HA/DR  Community  of  Interest 
(COI)  on  the  All  Partners  Access  Network  (APAN)  during  the  Haiti  crisis.  The  APAN  data 
was  captured  from  a  MSSQL  database.  The  sources  were: 

•  Official  documents  and  briefings  in  PDF:  ~167  PDF  file  attachments  related  to  HAITI  HA/DR  from 
1/13/2010  to  5/26/2010 

•  SITREP:  ~150  Situation  Report  documents 

•  Forum:  ~1173  posts  from  1/13/2010  to  6/3/2010 

•  Blogs:  ~3900  blog  messages 
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Lexical  Link  Analysis 


•  Automatically  discovers  word  pairs  and 
groups  using  social  network  community 
finding  algorithm  Newman  community 
finding  algorithm  -  words  as  in  a 
community  (Girvan  &  Newman,  2002) 


•  Text  analysis  method 

•  Bi-gram  co-occurrence 

•  Text-as-networks 
(semantic  networks) 
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•  Linguistics  based 

-  Part-of-speeches  (e.g.  nouns,  verbs):  SAP  Business  Objects/InXight 


NEE 


-  Linguistics  rules 

•  Statistical  Co-occurrence 

-  Tf*idf  (http://en.wikipedia.org/wiki/TPidf) 

-  Bi-gram,  N-gram 

•  Representation 

-  Bag-of-words  (BOW),  Latent  Semantic  Analysis  (LSA),  Latent 
Dirichlet  Allocation  (LDA),  Support  Vector  Machines  (SVM,  accurate  on 
text  categorization,  cloud/large-scale  implementation  example  like 
yottamine) 

-  Text-as-networks  (TAN):  Centering  Resonance  Analysis  (CRA),  Stanford 
Lexical  Parser  (syntactic  dependence),  AutoMap  , Google  PageRank 
(DAN,  Document-as-Networks), 


Distributed  Information  Systems  Experimentation  (DISE) 


Unique  Features  of 
System  2 


Unique  Features 
System  1 


Lexical  Links  between 
System  1  and  2 
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Term  Evaluation 


Once  the  document  is  constructed  as  a  network,  the  next  step  is  to  evaluate  the  nodes  to 
determine  the  importance  of  each  term.  To  do  this,  we  employ  the  concept  of  node  centrality. 

The  centrality  of  a  node  is  a  measure  of  how  important  it  is  compared  to  the  other  nodes  in  the 
network. 

There  exists  a  myriad  of  different  node  centrality  measures.  These  can  roughly  be  divided  into  four 
main  categories  as  illustrated  by  the  figure  below. 


Out-degree 


/—(  Current-flow  betweenness  I 
Betweenness  ^  , 

Closeness  } — '  Current-flow  closeness  I 


The  simplest  category  is  the  degree-based  centrality.  These  define  node  centrality  in  terms  of  the 
number  and  strengths  of  connections  between  a  node  and  its  neighbors. 

The  eigenvector  centralities  are  also  based  on  the  node's  neighbours,  but  is  a  bit  more  complex. 
These  measures  capture  not  only  the  number  of  neighbors  a  node  has,  but  also  take  into  account 
the  importance  of  each  neighbour.  This  group  include  among  others,  Google's  famous  PageRank 
algorithm. 

The  groups  of  closeness  and  betweenness  centrality  measures  are  a  bit  different,  in  that  they 
focus  more  on  the  overall  network  structure.  Closeness  centralities  are  defined  in  terms  of  the 
lengths  of  the  shortest  paths  from  a  node  to  the  rest  of  the  nodes  in  the  network.  Betweenness 
centrality  describes  whether,  and  how  frequently,  a  node  is  part  of  the  shortest  paths  between 
pairs  of  other  nodes  in  the  network. 

In  an  attempt  to  find  a  node  centrality  measure  well  suited  for  capturing  term  importance,  we 
experimented  with  all  the  measures  shown  in  the  figure  above.  The  figure  below  shows 
classification  accuracies  when  using  representations  based  on  different  centrality  measures. 
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rmtSUNTlA  PER  SCIENTIAmI 


•  Critical  needs 

•  Links  among 
documents/categories  of 
information  do  not  usually  exist 

•  Variety  of  network  centrality 
measures  can  be  used  for  term 
evaluations 

LLA  automatically  discovers  relations  among 
categories  of  text  information  to  facilitate 
search,  prioritization,  prediction  and  expertise 
location 

-  Generate  hyperlinks  for  intranet  documents 

LLA  can  also  focus  on  innovations  and 
uniqueness  of  the  analyzed  documents 

-  Other  ranking  techniques  which  typically  sort  documents 
based  on  the  popularity  or  authority ,  are  not  based  on 
semantics 

•  E.g.  PageRank  by  Google 
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Event  Date  Sort 

Theme  Id 

LLA  Counts 

Theme  Keywords 

ALL 

95 

83 

WATER,  AIR,FOOD 

ALL 

65 

67 

PEOPLE,  CHiLDREN,POST  ; 

ALL 

6 

65 

APAN,COI\  TACT,  USER 

ALL 

people 

65 

HAITI^OFFICER,  PRESIDENT  | 

ALL 

64 

60 

PAP,  EMERGENCY,  ASSESSMENT,  ASSISTANCE 

ALL 

41 

58 

INFORMATIQN,COMMUNITY,  REQUESTS  ] 

ALL 

54 

56 

MEDICAL, Tl  ME,  LIMITED  ! 

ALL 

29 

56 

EARTHQUAKE,  LEFT,  MASSIVE,  QUAKE  | 

ALL 

46 

51 

PATIENTS,  LONG,  CARE  | 

ALL 

70 

50 

RELIEF, FIELD, EFFORT 

ALL 

73 

50 

GROUP, SEE, FRENCH 

ALL 

96 

50 

DE, POSTED,  INITIAL, 'WORKING 

ALL 

22 

50 

COUNTRY,HEALTH,AREAS, CHILE 

ALL 

place 

49 

STATE, PORT  AU  PRINCE, CITY 

ALL 

78 

47 

BUILDING,HQ,CIVIL,USERS 

ALL 

86 

44 

TEAM,COORDINATION,LEAD 

ALL 

39 

42 

HOSPITAL, BED, HOUSE 

ALL 

82 

42 

SUPPORT,PERSONNEL,CLIENT,UNIT 
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DISE 


A  "Theme"  Defined 


r P^ST^NTimRSClENT/^jj 


(J1L 


A  collections  of  word  pairs  for  the  theme  associated  with  keywords  “WTER,  AIR,  FOOD” 
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DISE 


Distributions  of  LLA  Counts  per  Week 


Date  5 

Theme  ID 

LILA  Counts 

Theme  Keywords 

ALL 

95 

S3 

WATER  .AIR,  FOOD 

1/20/2010 

95 

39 

WATER,AIR,FQQD 

1/13/2010 

95 

37 

WATER,AIR,FOOD 

1/27/2010 

95 

20 

WATER,AIR,FOOD 

3/10/2010 

95 

7 

WATER,AIR,FOOD 

2/10/2010 

95 

3 

WATER,  AIR,  FOOD 

2/17/2010 

95 

3 

WATER, AIR, FOOD 

2/24/2010 

95 

3 

WATER, AIR, FOOD 

2/3/2010 

95 

2 

WATER,AIR,FOOD 

3/3/2010 

95 

2 

WATER,AIR,FOOD 

ALL 

65 

67 

PEOPLE,CHILD  REN,  POST 

1/20/2010 

65 

35 

P  EOF  LE,CH  1 LD  R  EN,  P  GST 

1/13/2010 

65 

19 

PEOPLE,  CHILDREN,POST 

1/27/2010 

65 

S 

P  EOP  LE,CH  1  LD  R  EN,  P  OST 

2/10/2010 

65 

7 

PEOPLE, CHILDREN, POST 

3/3/2010 

65 

4 

PEOPLE, CHILDREN, POST 

2/24/2010 

65 

3 

PEOPLE, CHILDREN, POST 

2/3/2010 

65 

2 

PEOPLE, CHIILDREN,POST 

2/17/2010 

65 

1 

PEOPLE, CHILDREN, POST 

3/10/2010 

65 

1 

PEOPLE, CHILDREN, POST 

3/21/2010 

65 

1 

PEOPLE, CHILDREN, POST 
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Themes  sorted  for  the  first  week  (1/13/2010  to  1/19/201)  and  the 
second  week  (1/20/2010  to  1/26/2010) 


Even:  Date  Sort 

Theme  Id 

LLA  Counts 

Theme  Keywords 

1/13/2010 

ALL 

630 

1/13/2010 

95 

37 

WATER,AIR,FOOD 

1/13/2010 

37 

25[HTGH7RaTE5 

1/13/2010 

5 

22' 

AiKLKAh  1  tbKUUI\IU,Abbib  1  sUKUbKbUPUbl\Jb  LUIVIhUH  1 

1/13/2010 

6 

21 

A  PAN,  CONTACT,  USER 

1/13/2010 

82 

21 

SUPPORT,PERSONNEL,  CLIENT, UNIT 

1/13/2010 

65 

19 

PEOPLE, CHILDREN, POST 

1/13/2010 

people 

18 

HAITI  .OFFICER,  PRESIDENT 

1/13/2010 

64 

18 

PAP, EMERGENCY,  ASSESSMENT,  ASSISTANCE 

1/13/2010 

79 

IS 

STAFF,VERSION,BATTLE,  LOGS,  RECORDS 

1/13/2010 

69 

IS 

PROVIDE,  DEAD,NEEDED 

1/13/2010 

39 

17 

HOSPITALBED,  HOUSE 

Even:Date Sort 

Theme  Id 

LLA  Counts 

Theme  Keywords 

1/20/2010 

95 

39 

WATER ,  A 1 R ,  F  O  O  D 

1/20/2010 

65 

35 

PEOPLE, CHILDREN, POST 

1/20/2010 

people 

33 

HA  ITLOFFICER,  PRESIDENT 

1/20/2010 

46 

32 

PATIENTS, LONG, CARE 

1/20/ 20 10 

29 

26 

EARTHQUAKE, LEFT, MASSIVE, QUAKE 

1/20/2010 

54 

25 

MEDICALTI  ME,  LIMITED 

1/20/2010 

41 

24 

INFORMATION, COMMUNITY,  REQUESTS 

1/20/2010 

6 

23 

A  PA  N,  CO  NT A  CT,  US  E  R 

1/20/2010 

70 

23 

RELIEF, FIELD, EFFORT 

1/20/2010 

72 

23 

RESPONSE,  HELICOPTER, PART 

•  Important  issues  were  heavily  discussed  and  collaborated  in  the  time  of 
frame  of  two  weeks. 

-  The  theme  “HIGH,  RATES”  was  related  to  the  fact  that  the  US 
government  provided  the  analysis  and  information  about  high  fatalities, 
high  risks  and  high  disease  rates  that  might  be  expected  after  the  event. 
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Lexical  links  for  the  theme  associated  with  keywords  "AIRCRAFT, 
GROUND,  ASSIST,  ORDERED,  USNS_COMFORT" 


•  Measuring  like-mindedness.  Like-minded 
organizations  do  use  many  of  the  same 
terminologies  in  their  communications,  like- 
mindedness  is  closer  in  meaning  to  like- 
wordedness 


•  LLA  often  places  the  different 
terminologies  such  as  “CVN”  and  “Carl_ 
Vinson”  in  different  components  of  word 
hubs  but  grouped  together 
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DISE 


The  overlapping  of  top  ten  themes  sorted  by  the  LLA 
counts  overall  (ALL)  and  the  first  and  second  weeks 


Theme  Keywords 

Theme  Keywords 

WATER,  AIR,  FOOD 

WATER,  AIR,FOOD 

PEOPLE,  CHILDREN,  POST 

HIGH,  RATES 

APAN, CONTACT, USER 

AIRCRAFT,GROUND,  ASSIST,  ORDERED,  USNSjIOMFORT 

HAITI,OFFICER,  PRESIDENT 

A  PAN,  CONTACT,  USER 

PAP, EMERGENCY, ASSESSMENT, ASSISTANCE 

Weekl 

SUPPORT, PERSONNEL, CLIENT, UNIT 

INFQRMATION,COMMUNITY,  REQUESTS 

WATER, AIR, FOOD 

M  ED  ICAL/TI  ME,  LIMITED 

PEOPLE,  CHILDREN,POST 

E  A  RTH  QUA  K  E,  LE  FT,  MASS  1 V  E,  QUA  K  E 

HAITI,  OFFICER, PRESIDENT 

PATIENTS,  LONG,  CARE 

PATIENTS,  LONG,  CARE 

\\\ 

REUEFpFIELD, EFFORT 

Week2 

EARTHQUAKE, LE=T, MASSIVE, QUAKE 

The  LLA  counts  are  measures  to  sort  out  the  main  and  important  topics  and 
themes 

•  6  out  of  10  top  themes  (60%)  in  the  top  themes  overall  are  also  in  the  top 
themes  of  the  first  two  weeks. 

•  7  of  10  top  themes  in  the  first  two  weeks  are  also  in  the  top  overall  themes 
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Extracting  Social  Networks 


1 

Thread 

Level 

Time 

User 

Organization 

Content 

43 

5722 

2 

1/14/20 10 

2205 

DoD  U  S  Southern 

re:  attention! ! !  notice  to  airmen  notam  "  That  is  correct.  In  orderto  gain  aerial  access 

44 

5722 

2 

1/15/2010 

3342 

Department  of 

re:  attention! ! !  notice  to  airmen  notam  "  1  suggest  that  MTPP  be  used  as  a  tag  for  all 

45 

5729 

1 

1/14/2010 

3645 

DoD  Air  Force 

pap  ground  support  1  noticed  this  morningthat  there  was  a  60K  main  deck  loader  at 

46 

5736 

1 

1/14/2010 

2205 

DoD  U  S  Southern 

port-au-prince  harbor  port  damage  photos  "  []  []  []  []  []  []  11 

47 

5737 

1 

1/14/2010 

2205 

DoD  U  S  Southern 

rfi  on  behalf  of  mlgp  haiti "  Is  there  any  info  on  a  USCG  helo  flight  inbound  from  Santo< 

48 

5737 

2 

1/14/2010 

2112 

DoD  US  Southern 

rfi  on  behalf  of  mlgp  haiti "  Heather,  dont  know,  but  we  should  send  invite  usingthe 

49 

5737 

3 

1/14/2010 

2205 

DoD  U  S  Southern 

rfi  on  behalf  of  mlgp  haiti  Yes!  Lets  do  it.  can  you  get  them  the  directions  or  do  you 

it) 

5737 

4 

1/15/2010 

2112 

DoD  US  Southern 

rfi  on  behalf  of  mlso  haiti "  Heather.  I  dont  want  to  stern  out  of  mv  lane.  I  think  it 

51. 

5741 

1 

1/14/2010 

3645 

DoD  Air  Force 

mtpp  airfield  sitrep  Can  anyone  provide  any  information  regardingthe  Airfield 

52 

5741 

2 

1/15/2010 

3342 

Department  of 

re  pp  ar  e  c  sitrep  '  suggest  that  IVTPP  be  used  as  a  tag  "ora  \QTAIVs.  b  og  posts. 

52 

5741 

2 

1/14/2010 

3545 

SSSG 

mtpp  an  ecs  trep  '  South  err  Command  -ress  Re  ease.  QUOTE  'are  c  has  power 

54 

5741 

3 

1/15/2010 

4038 

Teiecommunicatio 

mtpp  airfield  sitrep  "  Both  should  be  used,  since  Civilian  Operators  use  NOTAMs  as  j 

■ 

A2A} 

A 

1/15/2010 

3342 

Department  of 

mtpp  airfield  sitrep  Good  point. 

56 

5743 

1 

1/14/2010 

2406 

Integrated  ICT 

usar  activity  as  o 

57 

5744 

1 

1/14/2010 

2406 

Integrated  ICT 

logistical  activity 

58 

5745 

1 

1/14/2010 

2406 

Integrated  ICT 

coordination  act 

59 

5746 

1 

1/14/2010 

2406 

Integrated  ICT 

situation  activity 

List  of  threads  of 
aiscussions 


i 

10022 

10023 

10033 

10100 

10113 

10393 

10406 

10432 

10473 

10484 

79 

M 

1 

taryUSANORTHCOM_ 

taryUSANORTHCOMJoint 

0 

0 

0 

0 

0 

0 

0 

■^i  nreaas„ 

80 

M 

1 

taryUSANationalDefenseUniversity_ 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

81 

M 

1 

taryUSANavy_ 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

82 

M 

1 

tarvUSANavy_Command 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

85 

M 

1 

taryUSANavyJnformationManagement 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

84 

M 

1 

taryUSANavyJnfrastructure 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

85 

M 

1 

taryUSANavyJntelligence 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

86 

M 

1 

ta  ry  U  S  A  N  a  vy_J  o  int 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

87 

M 

1 

taryUSANavyJointIT 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

88 

M 

1 

taryUSANavy_Languagelnstitute 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

89 

M 

1 

taryUSANavy_Planning 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

9C 

M 

1 

taryUSAPACOM_ 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

91 

M 

1 

taryUSAPACOM_APAN 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

92 

M 

1 

taryUSAPACOMJnformationManagement 

jj 

0 

0 

0 

0 

0 

0 

0 

0 

0 

93 

M 

1 

taryUSAPACOMJoint 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

94 

M 

1 

taryUSAPACOM_Planning 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

95 

M 

1 

taryUSAPACOM_Simulation 

0 

0 

0 

2 

0 

0 

0 

0 

0 

0 

96 

M 

1 

taryUSASOUTHCOM_ 

0 

0 

0 

0 

0 

0 

1 

1 

0 

0 

97 

M 

1 

taryUSASOUTHCOM_Command 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

98 

M 

1 

taryUSASOUTHCOM_GIS 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

99 

M 

1 

taryUSASOUTHCOMJnformationManagement 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

ICC 

M 

1 

taryUSASOUTHCOMJntelligence 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

L01 

M 

1 

taryUSASOUTHCOM  Joint 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

182 

M 

1 

taryUSASOUTHCOM_ManagementSOUTHCOM 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

L  8  5 

M 

1 

taryUSASOUTHCOM_Partnering  ^ 

r  0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

184 

M 

1 

taryl)SASOUTHCOM_Planning  r  ll7QtinnQ 

taryUSASOUTHCOM_Security  V/l  LjQl  IIZ_ClUL  1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

L  8  5 

M 

1 

1  0 

0 

0 

0 

0 

0 

0 

0 

0 

L06 

M 

1 

tarvUSASOUTHCOM  SpecialOperations 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Distributed  Information  Systems  Experimentation  (DISE) 


Naval  Postgraduate  School 


DISE 


Social  Networks  Associated  with  the  Theme 

"WATER,  AIR,  DATA" 


•  Light  blue  links  are  social  network  links  that  also 
overlap  in  the  semantic  networks  -  indications  of 
synergy 
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Extract  Semantic  Networks 


DISE 


)rivateUSACopernicusEnergylnc_Energy  MilitaryUSAAnwEngineering  5  ;  INFORMATION— (189.00) 

)rivateUSACopernicusEnergylnc_Energy  MilitarySouthAfricaArmyZA^  5  ;  INFORMATION— (189.00) 

)rivateUSACopernicusEnergylnc_Energy  MilitaryUSA50UTHC0M_  5  ;  INFORMATION— (189. 00) 

JrivateusACopernicusEnergyinc_Energy  Mi  1  "iTarycanadaNaTi onal  Def enseCA_  5  ;  information— (189. 00) 

’rivateusACopernicusEnergyincEnergy  Mi  1  i t aryusAPACOM_si mul  at i on  5  ;  information-community (189. 00) 

JrivateusACopernicusEnergyinc_Energy  Government usadod_  5  ; small— (208.00) 

JrivateusACopernicusEnergyinc_Energy  MilitaryusASOUTHCOMjianagementsouTHCOM  5  ; information- (189. 00) 

JrivateusACopernicusEnerqyinc_Enerqy  PrivateusARapidResponseconsulti nq_Telecommuni cationsDi sasterResponse  5  ; information— (189. 00) 
JrivateusACopernicusEnergyinc_Energy  Mi  1  itaryusANavy_irrf ormati onManagement  5  [power— (802.00)  | 

’rivateUSACopernicusEnergylncEnergy  NGOUSAWorl dear es_DisasterRes pons eNetworks  !  ’ IMI-OkMAI  1UN— (189. 00) 
’rivateuSACopernicusEnergylncEnergy  Mi  1  itaryUSADoD_simul ati onJoi nt  5  ; SMALL— (208.00) 

)rivateUSACopernicusEnergylnc_Energy  Mi  1  i t aryUSAPACOM_APAN  5  ;  INFORMATION-REQUESTS (189. 00) 

)rivateUSACopernicusEnergylnc_Energy  Mi  1  itaryUSADoD_lntel  1  i  genceJoi nt  5  ;  INFORMATION— (189.00) 

)rivateUSACopernicusEnergylnc_Energy  PrivateUSAPaxMondial_Reconstruction  5  ; INFORMATION — CONFIDENTIAL (139. 00) 

uilllUryUSMbULUIJulfiL  HiniLiryUSAbUbJHruUlLlUUShdrltig  i  |SUSAN-lV.UU) - 

<ii  1  i t aryusAD0D_iT J oi  nt  Mi  1  itaryusAEur opeancommand_  5  ;  SUSAN— (68.00) 

JrivateusAEmicuscom_DisasterResponse  MilitaryusASOUTHCOM_security  5  ; bobbi — (1.00); jacmel— (111.00) 

3rivateusAEmicuscom_DisasterResponse  MilitaryusAArmy_Engineering  5  ;  background— (189.00) 

Dri  vat  eusAEmicuscoiiLDi  sasterResponse  NGOUSAProjectKiDincjvdoption  5  ;  food — (3  02. 00);  food— water  (3  02.00) ;  jacmel— (111.00) 
DrivateusAEmicuscom_DisasterResponse  MilitarysouthAfricaArmyaL  5  ; food —  (802. 00); request— (102.00) 

Dri vat eusAEmicuscom_Di sasterResponse  MilitaryusASOUTHCOM_  5 

;GEAR— (97. 00)  ;FOOD—pQ2. 00)  ;GOOGLE—p2  5. 00);  REQUEST— (102.  QQ);  REQUEST— EXPLANATION  (1Q2. 00);  TRANSPORTING— (111.  00);  BACKGROUND— (189.  QQ) 
3rivateUSAEmicuscom_DisasterResponse  Mi  1  i  t  ar  yUSAPACOM_si  mul  at  i  on  5  ;  FOOD— (302.00) 

3r i vat eUSAEmicuscom_Di sasterResponse  GovernmentUSADoD_  5 

;  FOOD-DRINKING,  CONTAMINATED  (3  02. 00);  FOOD-WATER  (3  02. 00);  FOOD-WATER,  DRINKING,  CONTAMINATED  (3  02. 00);  FOOD- (3  02. 00) 

-  ■  ■  -  -  -■  j - - -  ■■  - - ■  ■-  ■  ■  r ■  _ /-lA-i  AA\ _ 


•  Use  shared  word  hubs 
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Semantic  Networks  (Green)  Overlaid  with  the  Social 
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Blue:  Social  Networks 
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•  Red  circles  show 
organizations  who  have 
semantic  links  but  no 
current  social 
connections  => 
indicates  potential 
collaborations 
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•rivateuSADevel opmentTheMi 1 1 eni umGrouplnt 1_Managementconsu1ti ng  Mi 1 itaryuSAPACOM_inf ormati onManagement  5  ; LONG — term(200. 00) 
TivateuSADeve1opmentTheMi11eniumGrGuplnt1 Managementconsulting  Mil itaryuSADoD.lntell i genceJoi nt  5  ; FRENCH — (198. 00) 
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rivateuSASTrategos_securiTyconTracTing  PrivateuSAERDASlnc_Gls  5  ; hesitate — (254.00) 
r i vat euSAStr at egos_securitycontr acting  Government USADoD_  5  ; potable — (302. 00); potable — water(302. 00) 

r i vat eusAStr at egos_securitycontr acting  Mi  1 itaryusANavy_inf ormati onManagement  5  ; potable — (302. 00); fresh — water(3Q2. 00) 
ri vat euSAStr at ecios.securitvcontr act inq  Government usadod_it  5  ; potable — water (3 02. 00); hesitate — (254. 00) 

li  lit aryU5A56UTH&M_5peci alterations  Government U3ABH5_Cn saster Response  b  ;  U5«— *757 foil) - N  * - 

iovernmentuSAUSAlD_Budget  Mi  1 itaryuSAPACOM.Pl anni ng  5  ; USG — (111.00) 
lover nmentuSAUSAlD_Budget  Mi  1 itarySouthAf ri caArmyZA.  5  ; USG — (111.00) 
lover nmentusAUSAiD_Budget  NGOHaiti HaitiMi ssi ori esofcharity_  5  ; great — see(198.00) 
lover nmentuSAUSAlD_Budget  Educati onusADeltastateuniversity.Gls  5  ; USG — (111.00) 
lover nmentuSAUSAlD_Budget  Mi  1 it aryuSASOUTHCOM_lnf ormati onManagement  5  jgreat — (198.00) 
li  1  it  aryuSADoD.int  ell  i  genceJoi  nt  Mi  1 itaryuSAArmy_Engi neeri ng  5  ; information — (189.00) 
ii1itaryusAD0D_inteni genceJoi nt  PrivateusAERDASinc_Gis  5  ; response — (182.00) 
lilitaryuSADoD_inte11i genceJoi nt  Government uSADoD_LanguagelnstituteJoi nt  5  ; relief — (257.00) 
li1itaryuSADoD_lntelli genceJoi nt  PrivateuSACoperni cusEnergylnc_Energy  5  ; information — (189.00) 

lilitaryuSADoD.lnte'I'li genceJoi nt  Pr i vat eUSAEmi cuscom_Di saster Response  5  ;GEAR — (97. 00); GOOGLE — (225. 00) ; TRANSPORTING — (111. 00) 
iilitaryusADoD_mteni  genceJoi  nt  NGOUSAProj  ectKiDinc_Adopti  on  5  ;  response — (182.00) 
li1itaryusAD0D_inte11i genceJoi nt  Mi  1 itaryuSAArmy.  5  ; relief — (2  57.00) 

li  1  itaryuSADoD.lnt  ell  igence  Joint  Mi  1 itarysouthAfri caArmyz/c_  5  ; INFORMATION — (189. 00); RESPONSE — (182. 00) 
li 1 itaryusADoD.int ell igence Joint  Mi  1 itaryuSASOUTHCOM_  5 

INFORMATION — (189. 00); GEAR — (97. 00); RELIEF — (257. 00); RELIEF — EFFORT(257. 00) ; GOOGLE — (22 5 . 00) ; TRANSPORTING — (111. 00) 
lilitaryuSADoD_intelli genceJoi nt  Mi  1 itarycanadaNati onal Def enseCA_  5  ; information — (189.00) 

li 1itaryusADoD_int ell igence Joint  Mi  1 itaryusAPACOM_simu1 ati on  5  ; information — community (189. 00); relief — (2  57. 00) 
li1itaryuSAD0D_lnteTli genceJoi nt  Independent.  5  ; RELIEF — (2 57. 00) ; RESPONSE — (182.00) 

lilitaryusADoD.inteVli genceJoi nt  Government usadod.  5  ; relief — (257. 00); people — animals(267. 00); people — (267. 00) 
li 1itaryusAD0D_int ell igence Joint  Mi 1 itaryusAS0UTHC0M_ManaqementS0UTHC0M  5  ; information— (189. 00) 
lilitaryuSADoD.lntelli genceJoi nt  NGOHaiti HaitiMi ssi ori esofcharity.  5  ; FRENCH — (198.00) 
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A  holistic  view  of  social  network  with  semantic 
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Measure  Synergy  Using  Overlapping 
Connections  in  the  Social  networks  and 
Semantic  networks 


Sorted  according  to  the  Synergy 

Ratio=Overlapped  (Light  Blue)/Social  (Blue  and  Light  Blue) 


Social(Bliie 
and  Lis  hr 
Bine] 

Overlapped 

{Lisht 

Blue) 

Semantic 

(Green) 

Involve  me  nr 
(Red) 

Syner gy 

Top  10  for  rhe 
First  or 

Second  Week 

HIGH,  RATES 

16 

16 

20 

9 

100% 

Yes 

PEOPLE,  CHILDREN,  POST 

140 

124 

308 

26 

89% 

Yes 

A  PAN,  CONTACT,  USER 

1S6 

160 

272 

38 

86% 

Yes 

EARTH  QUAKE,  LEFT,  MASSIVE,  QUAKE 

58 

48 

184 

20 

83% 

Yes 

AIRCRAFT,GROUND,ASSIST,ORDERED,USNS  COMFORT 

116 

96 

102 

21 

83% 

Yes 

SUPPORT,PERSONNEL,CLIENT,UNIT 

90 

74 

128 

23 

82% 

Yes 

MEDICAL,  TIME,, LIMITED 

116 

93 

170 

26 

80% 

No 

WATER,  A 1 R,  F  0  0  D 

264 

206 

511 

36 

78% 

Yes 

RELIEF, FIELD.EFFORT 

186 

142 

386 

32 

76% 

No 

INFORMATION,  COMMUNITY,  REQUESTS 

234 

176 

296 

35 

75% 

No 

PAT  ENTS.  LONG.  CARE 

104 

74 

262 

25 

71% 

Yes 

HAITI, OFFICER, PRESIDENT 

130 

90  298 

33 

69% 

Yes 

PAP,  EMERGENCY, ASSESSMENT, ASSISTANCE 

252 

174  312 

37 

69% 

NO 

•  There  are  fewer  organizations  involved  in  the  two  weeks  than  in  the  overall  forum 

•  “Synergies”  seem  to  go  above  the  average  (>80%)  for  some  specific  and  time-critical  themes,  for  example, 

“HIGH, RATES”,  “PEOPLE, CHILDREN, POST”,  “APAN, CONTACT, USER”, 

“AIRCRAFT, GROUND, ASSIST, ORDERED, USNS_COMFORT”,”  SUPPORT, PERSONNEL, CLIENT, UNIT”,  and 
“EARTHQUAKE, LEFT, MASSIVE, QUAKE” 

•  There  are  some  themes  with  relatively  low  synergies  overall  and  on  the  two  weeks,  for  example,  “WATER,  AIR, 
FOOD”,  “PATIENTS,  LONG,  CARE”, ’’HAITI,  OFFICER,  PRESIDENT”.  This  may  indicate  lack  of  knowledge  or  expertise 
in  these  areas  in  general. 

•  Comparing  the  two  weeks  and  the  average,  the  synergies  among  organizations  seem  correlated  with  the  number  of 
organizations  involved  in  the  collaborations:  the  themes  with  fewer  organizations  involved  have  higher  synergies. 
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•  Explored  a  real-life  data  set,  i.e.  APAN  forum  data  in  the  Haiti  interagency 
operation  via  Lexical  Link  Analysis 

-  Discovered  the  themes  and  sorted  them  according  to  the  importance  and  time 

-  Extracted  social  and  semantic  networks  of  participating  organizations 

•  Showed  the  LLA  method  to  examine  historic  data  and  improve  future  operations 

-  Identify  partners  for  collaboration  or  areas  of  overlap 

-  Examined  the  aspects  of  the  situational  awareness  of  an  interagency  operation 

-  Monitored  and  visualized  organizational  behavior 

-  Automation  allows  performing  this  task  easier  and  more  frequent  -  and  in  near  real-time 
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Lexical  Link  Analysis  (LLA) 

Summary 
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•  Lexical  Analysis  (LA  wiki,  2009)  is  a  form  of  text  mining 

-  Learns 

-  Dynamically  updates  word  and  context  associations  with  added  data 

•  Link  analysis 

-  Like  network  analysis,  explores  and  illustrates  associations  between  objects 

•  Lexical  Link  Analysis  (LLA) 

—  Combines  data  mining  with  network  analysis 

-  Can  dynamically  identify,  assess,  and  predict  trends,  patterns  and  features 

•  Data  mining  tools 

-  Analyzes  structured  and  unstructured  data 

—  Confirms  previously  known  patterns,  or  discovers  new  patterns 

-  Implements  innovative  visualization  and  navigation  techniques 

•  Facilitates  concept  discovery,  automated  classification,  and  categorization  of 
unstructured  documents 
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Agent  Learning 
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